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Description 

The present invention relates to recombinant DMA sequences, vectors containing them, and a method for 
the use thereof. In particular, the present invention relates to recombinant DNA sequences which encode the 

5 complete amino acid sequence of a glutamine synthetase (GS) (L-gl utamate : ammonia ligase [ADP-forming] ; 
EC 6.3.1.2) and to the use of such nucleotide sequences. 

The ability of cloned genes to function when introduced into host cell cultures has proved to be invaluable 
in studies of gene expression. It has also provided a means of obtaining large quantities of proteins which are 
otherwise scarce or which are completely novel products of gene manipulation. It is advantageous to obtain 

10 such proteins from mammalian ceils since such proteins are generally correctly folded, appropriately modified 
and completely functional, often In marked contrast to those proteins as expressed In bacterial cells. 

Where large amounts of product are required, it is necessary to identify cell clones in which the vector sequ- 
ences are retained during cell proliferation. Such stable vector maintenance can be achieved either by use of 
a viral repiicon or as a consequence of integration of the vector into the host cell's DNA. 

15 Where the vector has been integrated into the host cell's DNA, the copy number of the vector DNA, and 
concomitantly the amount of product which could be expressed, can be increased by selecting for cell fines in 
which the vector sequences have been amplified after integration into the host cell's DNA. 

A known method for carrying out such a selection procedure Is to transform a host cell with a vector com- 
prising a DNA sequence which encodes an enzyme which is inhibited by a known drug. The vector may also 

20 comprise a DNA sequence which encodes a desired protein. Alternatively the host cell may be co-transformed 
with a second vector which comprises the DNA sequence which encodes the desired protein. 

The transformed or co-transformed host cells are then cultured in increasing concentrations of the known 
drug thereby selecting drug-resistant cells. It has been found that one common mechanism leading to the 
appearance of mutant cells which can survive In the increased concentrations of the otherwise toxic drug is 

25 the overproduction of the enzyme which is inhibited by the drug. This most commonly results from increased 
levels of its particular mRNA, which in turn is frequently caused by amplification of vector DNA and hence gene 
copies. 

It has also been found that, where drug resistance Is caused by an increase in copy number of the vector 
DNA encoding the InhibitaWe enzyme, there is a concomitant increase in the copy number of the vector DNA 
so encoding the desired protein in the host cell's DNA. There b thus an increased level of production of the desired 
protein. 

The most commonly used system for such co-amplification uses as the enzyme which can be Inhibited 
dihydrofolate reductase (DHFR). This can be inhfoited by the drug methotrexate (MTX). To achieve co-ampli- 
fication, a host cell which lacks an active gene which encodes DHFR Is either transformed with a vector which 

$5 comprises DNA sequences encoding DHFR and a desired protein or co-transformed with a vector comprising 
a DNA sequence encoding DHFR and a vector comprising a DNA sequence encoding the desired protein. The 
transformed or co-transformed host cells are cultured in media containing increasing levels of MIX, and those 
cell lines which survive are selected. 

Other systems for producing co-amplification have been employed. However, none of them has been as 

40 widely used as the DHFR/MTX system. 

The co-amplification systems which are at present available suffer from a number of disadvantages. For 
instance, it is generally necessary to use a host cell which lacks an active gene encoding the enzyme which 
can be Inhibited. This tends to limit the number of cell lines which can be used with any particular co-amplifi- 
cation system. For Instance, there is at present only one ceil line known which lacks the gene encoding DHFR. 

45 it would be advantageous if an effective co-amplification system based on a dominant selectable marker which 
was applicable to a wide variety of cell lines could be provided. This would allow exploitation of the different 
processing and growth characteristics of a variety of ceil lines. 

Attempts to use DHFR genes as dominant selectable markers in other cell lines has not proved entirely 
satisfactory. For instance, a MTX-resistant mutant DHFR or a DHFR gene under the control of a very strong 

50 promoter can act as a dominant selectable marker in certain ceil types but such high concentrations of MTX 
are required that it has not been possible to achieve very hig h copy numbers by selection for gene amplification. 

Co-transformants with an additional selectable marker also have disadvantages. For instance, this can 
Increase the complexity of piasmid construction and requires additional time-consuming screening of transfor- 
med ceils to distinguish those clones in which the DHFR gene is active. 

55 A further disadvantage of the known co-amplification systems is that the DNA sequence encoding the 
inhlbltable enzyme is generally not under posMranslational control. The enzyme in the amplified system is 
therefore produced in large quantities, together with the desired protein. This could lead to lower levels of pro- 
duction of the desired protein. 
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Another disadvantage of known co-amplification systems Is that resistance to the known drug can arise 
from mechanisms other than amplification. For instance, in the DHFR/MTX system, it Is possible for a mutant 
DHFR gene to arise which produces a mutant DHFR which has a lower binding affinity for MTX than does wild- 
type DHFR If such mutant DHFR arises, cells containing the gene which encodes it wBI be more resistant to 
5 MTX than the original host ceO and wll therefore be selected, even though no amplification has taken place. 
It is possible to select further to eliminate lines in which no amplification has taken place, but this is a time con- 
suming process. 

A further disadvantage of previous selection systems for gene amplification Is that toxic drugs are required. 
In particular MTX Is a potential carcinogen. 
10 An additional disadvantage of previous amplication systems is the need for repeated, time-consuming 
rounds of amplification, for example three or more, to obtain maximum copy number. 

It Is an object of the present Invention to overcome at least in part the disadvantages of the prior art systems 
for co-amplification. 

According to a first aspect of the present Invention there is provided a method for co-amplifying a reconv 
15 binant DNA sequence which encodes the complete amino acid sequence of a desired protein other than a 
glutamine synthetase (OS), which method comprises : 

(a) providing a vector capable, in a transfbnnant host ceil, of expressing both a recombinant DNA sequence 
so which encodes an active GS enzyme and the recombinant DNA sequence which encodes the complete 

amino acid sequence of the desired protein other than GS ; 

(b) providing a eukaryotic host cell which Is a glutamine prototroph ; 

(c) transforming said host cell with said vector ; and 

(d) culturing said host cell under conditions which allow transformants containing an amplified number of 

29 copies of the vector-derived GS-encodlng recombinant DNA sequence to be selected, which transfor- 
mants also contain an amplified number of copies of the desired protein-encoding DNA sequence. 

According to a second aspect of the present Invention, there is provided a method for co-amplifying a 

30 recombinant DNA sequence which encodes the complete amino acid sequence of a desired protein other than 
a GS, which method comprises : 



(a) providing a first vector capable, in a trensfbrmant host cell, of expressing a recombinant DNA sequence 
35 which encodes an active GS enzyme ; 

(b) providing a second vector capable, in a transformant host cell, of expressing the recombinant DNA sequ- 
ence which encodes the complete amino acid sequence of the desired protein other than GS ; 

(c) providing a eukaryotic host cell which is a glutamine prototroph ; 

(d) transforming said host ceil with both said first and said second vectors ; and 

40 (e) culturing said host cell under conditions which allow transformants containing an amplified number of 
copies of the vector-derived GS-encoding recombinant DNA sequence to be selected, which transfor- 
mants also contain an amplified number of copies of the desired protein-encoding DNA sequence. 

45 According to a third aspect of the present invention, there is provided a method for using a vector as a domi- 
nant selectable marker in a cotrensfonmation process which comprises : 



(a) providing a vector capable, in a transformant host cell, of expressing a recombinant DNA sequence 
60 which encodes an active GS enzyme and a recombinant DNA sequence which encodes the complete 

amino acid sequence of a desired protein other than GS ; 

(b) providing a eukaryotic host cell which is a glutamine prototroph ; 

(c) transforming the host cell with the vector ; and 

(d) selecting transformant cells which are resistant to GS Inhibitors, 

65 

whereby transfbnnant ceils are selected in which the vector-derived GS-encoding sequence serves as 
a dominant selectable and co-amplifiable marker. 
According to a fourth aspect of the present invention, there is provided a method for using a vector as a 
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dominant selectable marker In a cotransformation process which comprises : 



(a) providing a vector capable. In a transformant host cell, of expressing a recombinant DNA sequence 
5 which encodes an active GS enzyme ; 

(b) providing a second vector capable, In a transformant host cell, of expressing a recombinant DNA sequ- 
ence which encodes the complete amino acid sequence of a desired protein other than GS ; 

(c) providing a eukaryotic host which is a glutamine prototroph ; 

(d) transforming said host cell with both said first and second vectors ; and 
10 (e) selecting transformant ceils which are resistant to GS inhibitors, 



whereby transformant ceils are selected in which the vector-derived GS-encoding sequence serves as 
a dominant selectable co-amplifiable marker. 
According to a fifth aspect of the present Invention, there is provided a recombinant DNA vector comprising: 

15 



(a) a recombinant DNA sequence which encodes the complete amino acid sequence of a GS ; and 

(b) a recombinant DNA sequence which encodes the complete amino acid sequence of a desired protein 
other than said GS, 

20 

the vector being capable, in a transformant host cell, of expressing both said recombinant DNA sequ- 
ences (a) and (b). 

Typically, in each aspect of the invention, the GS-encoding recombinant DNA sequence encodes an 
eukaryotic, preferably mammalian, GS. Conveniently, the GS-encoding recombinant DNA sequence encodes 
25 a rodent, such as mouse, rat or especially hamster, GS. 

Preferably, the GS-encoding recombinant DNA has the sequence of the amino acid coding portion of the 
sequence shown in Figure 2, and most preferably comprises the whole recombinant DNA sequence shown in 
Figure 2. 

Glutamine synthetase (GS) is a universal housekeeping enzyme responsible for the synthesis of glutamine 
so from glutamate and ammonia using the hydrolysis of ATP to ADP and phosphate to drive the reaction. It is invol- 
ved in the integration of nitrogen metabolism with energy metabolism via the TCA cycle, glutamine being the 
major respiratory fuel for a wide variety, possible the majority, of cell types. 

GS Is found at a low level (0.01 %-0.1% of soluble protein) in most higher vertebrate cells and is found at 
higher levels ( > 1% of total protein) in certain specialised cell types such as hepatocytes, adipocytes and glial 
35 cells. 

A variety of regulatory signals affect GS levels within cells, for instance glucocorticoid steroids and cAMP, 
and glutamine in a culture medium appears to regulate GS levels post-translationaOy via ADP ribosylation. 

GS from all sources is subject to inhibition by a variety of inhibitors, for example methionine sulphoxhnine 
(Msx). This compound appears to act as a transition state analogue of the catalytic process. Extensively 
40 amplified GS genes have been obtained (Wilson R.H., Heredity, 49, 181, 1982 ; and Young A.P. and Ringold 
G.M., J. Biol. Chem., 258, 11260- 11266, 1983) in variants of certain mammalian cell lines selected for Msx 
resistance. Recently Sanders and Wilson (Sanders P.G. and Wilson R.H., The EM BO Journal, 3, 1, 65-71, 
1984) have described the cloning of an 8.2 kb Bglll fragment containing DNA coding for GS from the genome 
of an Msx resistant Chinese hamster ovary (CHO) cell line KGIMS. However, this fragment does not appear 
45 to contain a complete GS gene and It was not sequenced. 

Conveniently, the GS-encoding recombinant DNA sequence is cDNA, preferably derived by reverse tran- 
scription. However, the GS-encoding recombinant DNA sequence may alternatively or additionally comprise 
a fragment of genomic DNA. 

Preferably, the vector is an expression vector capable, in 8 transformant host cell, of expressing both the 
50 GS-encoding recombinant DNA sequence and the desired protein-encoding recombinant DNA sequence. 

Preferably, the GS-encoding recombinant DNA sequence is under the control of a regulatable promoter, 
such as a heat shock or a metallothionein promoter. 

The present invention also provides a host cell transformed with a vector according to the fourth aspect of 
the invention. 

55 There are a number of advantages to the methods of the present invention in co-amplification of non-seleo- 
ted genes. 

An advantage is that the GS gene is regulatable, for instance by addition of glutamine to the medium. It is 
therefore possible to amplify the GS gene and the non-selected gene, and then down-regulate the GS gene. 
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The host ceil wll then accumulate much smaller quantities of active GS whOe stDI producing desirably large 
quantities of the required product This also has the advantage of increasing the stabiity of the cell line, since 
there will be less selection pressure which could otherwise lead to Instability In maintenance of amplified sequ- 
ences in the cell line if the inhibitor is removed 

6 . It has been surprisingly and unexpectedly shown that GS expression vectors can also be used as effective 
selectable markers in cell lines which contain an active endogenous GS gene by conferring resistance to certain 
levels of Max at which the frequencies of resistance caused by endogeneous gene amplification is minimal. It 
has been shown that such vectors can be amplified by Increasing the concentration of Max In the cell lines so 
that high copy numbers are achieved These copy numbers are higher than achieved using previous amplffi- 

10 cation systems such as DHFR/MTX and are achieved in only two rounds of amplification. The possibility of 
attaining very high copy numbers Is advantageous In ensuring that high levels of mRNA encoding the desired 
protein are obtained. 

It Is believed, although the Applicants do not wish to be limited by this theory, that the effectiveness of GS 
as an amplifiable selectable marker is a consequence of the relative expression levels of endogeneous- and 
is vector-derived GS genes. Selection for gene amplification using Msx leads almost exclusively to the isolation 
of dones in which the vector-derived GS gene has been amplified in preference to the endogeneous gene. 
When using host cells containing an endogeneous active GS gene, it is possible to facilitate selection by reduo- 
Ing or abolishing endogeneous GS activity, for instance by treatment of the celt line with dibutyryt-cAMP and 
theophylline. A ceil line which is susceptible to such reduction or abolition is the 3T3-L1 cell line. 
20 The desired protein whose recombinant DNA sequence is co-amplified may be, for instance, tissue plas- 
minogen activator (tPA), although this technique can be used to co-amplify recombinant DNA sequences which 
encode any other protein, such as immunoglobulin polypeptides (Igs), human growth hormone (hGH) or tissue 
inhibitor of metalloproteinases (TIMP). 

Preferably, the amplification is achieved by selection for resistance to progressively increased levels of a 
25 GS inhibitor, most preferably phosphinothricin or Msx. 

A further advantage of the present oo-ampllflcation procedure is that Msx is a cheaply available product 
of high solubility. It can therefore readily be used at high concentrations to enable selection of lines containing 
highly amplified sequences. 

Moreover, the effect of Msx can be potentiated by the addition to the selection medium of methionine. It is 
so therefore preferred that In the present co-amplification procedure, selection is carried out in a medium contain- 
ing methionine at higher than usual levels. Similarly, the effect of Msx can be potentiated by lower levels than 
usual of glutamate. 

If the GS-encoding recombinant DNA sequence in the vector used for co-amplification is under the control 
of a regulatable promoter, It is preferable for expression of the GS sequence to be switched on during selection 
35 and amplification and subsequently down-regulated. 

In some cases, after co-amplification, the selected cell line may be dependent to some extent on the GS 
inhibitor used in the selection procedure. If this is the case, the amount of GS Inhibitor required may be reduced 
by adding glutamine to the culture medium whereby GS activity Is post-translationally suppressed. 

The host cells which are used In the methods of the present invention contain an active GS gene. For the 
40 reasons set out above, It has been found that selection can still be achieved even where an active endogeneous 
gene is present The advantages of using the vector of the present invention In co-amplification procedures 
are also shown In the use of the vectors as selectable markers. 

It Is preferred that the host cells used for the co-amplification procedures or selection procedures of the 
present invention are mammalian, most preferably hamster, cells. Chinese hamster ovary (CHO)-KI cells or 
4$ derivatives thereof are particularly suitable. 

It is therefore believed that the use of recombinant DNA sequences encoding GS, for instance in vectors 
for co-amplification or selection, will lead to highly flexible and advantageous systems which will be surprisingly 
superior to other similar systems, for instance based on DHFR/MTX. 

The present invention Is now described, byway of example only, with reference to the accompanying draw- 
60 Ings, In which: 

Figure 1 shows restriction maps of the GS specific cONA inserts in pGSC45, Xgs 1.1 and Xgs 5.21 dones, 
in which it can be seen from the arrows that the nucleotide sequence of the coding region of GS was pre- 
dominantly obtained from M13 subclones of Xgs 1.1 and various regions confirmed using subclones of Xgs 
521 and pGSC45 ; 

55 Figure 2 shows the cDNA (a :) and predicted amino acid (b :) sequences for the Chinese hamster GS gene, 
together with the published peptide sequences (c :) and peptide designations (d :) of bovine brain GS. The 
sequence (e :) indicates the polyadenyiation site used in Xgs 1.1. Amino acid residues are indicated as 
their single letter codes ; non-homologous bovine residues are indicated in lowercase letters. The 'A* below 
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base 7 represents the start of the p6SC45 insert and the ' — ' marker represents the priming sequence in 
Xgs 1.1 complementary to residues 1 135-1 132. The '>' and '< ' symbols represent bases involved in stems 
of the calculated structure for the 5' untranslated region ; 

Figure 3 shows the structure of three GS expression plasmids in which a) shows plasmid pSVLGS.1 (8.5 

5 kb) containing a 4.75 kb GS mlnigene under the control of the late region promoter of SV40 (L) cloned in 

the bacterial vector pCT54. The GS sequences include the complete coding sequence, a single intron and s 

approximately 2 kb of 3'-flanWng DNA spanning both of the presumed sites of polyadenylation, (b) shows 

plasmid pSV2.GS (5.5 kb) containing 1 J2 kb of GS cONA under the control of the early region promoter of 

SV40 (E), the intron from the t-antigen gene of SV40 and a sequence containing the early region polyadeny- % 

10 lation signal of SV40, and (c) shows plasmid pZIPGS (1225 kb) containing the Htndlll-BamHI fragment 
from pSV2.GS (containing the GS coding sequence and SV40 intron and polyadenylation signal) cloned 
in the retroviral vector pZJP Neo SV(X) in which hatched blocks indicate irrelevant mouse DNA sequences, 
5' and 3' LTRs are the long terminal repeats of Moloney Murine Leukaemia Virus (MMLV), the filled block 
represents an SV40 fragment spanning the origin of replication oriented such that the SV40 early region 

15 promoter directs the expression of the gene from a transposon which confers resistance to G418 in mam- 
malian cells (neo) and unmarked blocks contain additional DNA sequences from MMLV ; 
Figure 4 shows Southern blots of cell lines transfected with pSVLGS.1 (Panel A) or pSVZGS (Panel B). 
The blot is probed with an RNA probe specific for SV40 origin-region DNA. Panel A represents a 2 hour 
exposure. Each lane contains 2.5 pg genomic DNA from the following cell lines. Lanes 1 to 3 contain DNA 

20 from initial transfectants : lane 1 , SVLGS2 ; lane 2. SVLGS5 ; lane 3, SVLGS9. Lanes 4 to 6 contain DNA 
from cell lines obtained after a single round of selection for gene amplification with Msx : lane 4 SVLGS2 
(500 jiMR) ; lane 5, SVLGS5 (250 pMR) ; lane 6, SVLGS9 (500 pMR). Lane 7 contains DNA from a cell 
line subjected to 2 rounds of selection for gene amplification, SVLGS5 (2mMR). Panel B is an exposure 
of approximately two weeks. Each of lanes 1 to 6 contain 5 pg of genomic DNA and lane 7 contains 2.5 

25 pg. Lanes 1 to 3 contain DNA from initial transfectant cell lines : lane 1, SV2.GS20 ; lane 2, SV2.GS25 ; 
lane 3, SV2.GS30. Lanes 4 to 6 represent cell lines after one round of selection in higher concentrations 
of Msx : lane 4, SV2.GS20 (1 00 pMR) ; lanes SV2.GS25 (500 pMR) ; lane 6, SV2.GS30 (500 pMR). Lane 
7 represents a cell line obtained after two rounds of selection in Msx, SV2.GS30 (10mMR). 
Figure 5 shows a primer extension analysis of RNA derived from cell lines transfected with pSVLGS.1. A 

30 DNA oligonucleotide which binds to RNA at the presumed translation "start" was used to synthesise DNA 
from total RNA preparations. RNA preparations shown are : lane 1, SVLGS2 ; lane 2, SVLGS5 ; lane 3 a 
derivative of CHO-K1 resistant to 30 pM Msx (to indicate the extension from wild-type GS mRNA) ; MW, 
pAT153 digested with Hpall molecular weight markers. 

In the nucleotide and amino acid sequences shown in the accompanying drawings and in the description, 
35 the following abbreviations are used as appropriate. U ° uridine ; G = guanosine ; T = thymidine ; A * adeno- 
sine ; C ■ cytosine ; *** - a termination codon ; — denotes an unknown nucleotide residue ; A = alanine ; C = 
cysteine ; D = aspartic acid ; E = glutamic acid ; F « phenylalanine ; G = glycine ; H = histidine ; I c isoleuclne; 
K a lysine ; L « leucine ; M = methionine ; N = asparagine ; P = proline ; G ■ glutamine ; R = arginine ; S = 
serine ; T = threonine ; V = valine ; W = tryptophan ; Y » tyrosine ; X = an unknown amino acid ; PBS » phos- 
40 phate buffered saline ; SDS - sodium dodecyi sulphate ; and EDTA = ethylene diamine tetraacetic acid. 

Example 

Using a multi-step selection procedure In a glutamine-free medium, a mutant line was derived from the chin- 
45 ese hamster ovary (CHO) KG1 cell line (itself a derivative from the CHO-K1 line obtained as CCL 61 from the 
American Type Culture Collection, Rockvlle, MD, USA). The mutant cell line, labelled CHO-KG1 MS, is resistant 
to 5mM Msx. (The parental cell line KG1 is only resistant to 3 pM Msx). 

A subclone, KG1 MSC4-W, of the mutant cell line was used as a source of cellular DNA. Cells from the sub- 
clone were washed in PBS after trypsinization and pelleted at2000 r.p.m. fbr4 min. The pellet was resuspended 
so in 100 mM Tris-HCU pH 7.5. 10 mM EDTA and lysed by the addition of SDS to 2%. RNase A was added to 
50 pg/ml and the solution incubated at 37°C for 30 min. Protease K was added to 50 pg/ml and incubation con- 
tinued at 37°C for from 30 min to 1 hr. The solution was phenol extracted twice followed by two chloroform : * 
boamyl alcohol (24 : 1) extractions. The DNA was precipitated with Isopropanol and then resuspended in 2 
mM EDTA, 20 mM Tris-HCI , pH 7.5 and stored at 4°C. 
65 Genomic DNAs from parental KG1, mutants KG1MS and KG1MSC4-M, and revertant KG1MSC4-0 cells 
were digested with a variety of restriction endonudeases, subjected to agarose gel electrophoresis and South- 
em blotted onto nitrocellulose filters. These blots were probed with ollgo (<n>prlmed cDNA made from parental 
KG1 and mutant KG1MSC-4M poly(A) mRNAs. When wild-type KG1 cONA was used as a probe, a series of 
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identical bands was seen across tracks from aD cell lines. When KG1MSC4-M mutant cDNA was used as a 
probe, the same common bands were seen across all tracks together with unique bands specific to mutant 
KG1MS and KG1MSC4-M genomic DNA. The bands common to all genomic DNAs were shown to be due to 
mitochondrial (mt) DNA, as determined by restriction enzyme analysis of mtDNA purified from KG1 cells. The 
5 smallest DNA fragment identified which could contain the whole of the presumptive coding sequence for GS 
is an &2-kb Bglll fragment On double digestions with Pstl and Bgill. the two Pstl fragments (2.1 kb and 2.4 
kb) are seen to remain intact. Indicating that both Pstl fragments are contained within the BGHI fragment 

30 ug of KG1MSC4-M DNA was digested to completion with Bglll and the fragments separated by 
electrophoresis on an 0.8% agarose gel. The amplified 8.2 kb band was identified using ethldium bromide stain- 
to ing and long wave ultra violet radiation by comparison with XHindlll and mtPstl digests. The DNA band was 
eluted Into a well cut into the gel and purified by phenol extraction, chloroform extraction and ethanol precipi- 
tation using carrier tRNA. Purified DNA was ligated with BamHWigested, bacterial alkaline phosphatase-trea- 
ted pUC9 (Vieira, J. and Messing, J., Gene, 19, 259-268, 1982). Recombinant DNA was used to transform E. 
Coll to ampicfllin resistance and white colonies on Xgal picked for analysis. 
is 150 recombinant clones were obtained and DNA analysis of 11 of these showed that they all had DNA 
inserts of about 8.0 kb. Differential colony hybridization and DNA spot hybridizations Identified two recombinant 
clones which gave strong hybridization with a mutant KG1MSC4-M cDNA probe but no signal with a parental 
KG1 cDNA probe. Both recombinants pGS1 and pGS2 produced the Pstl restriction pattern expected from 
insertion of the required Bglll restriction fragment pGS1 DNA was used to hybrid select GS mRNA from total 
20 cytoplasmic and poly(A) KG1MSC4-M RNA. The selected mRNA was translated together with KG1 and 
KG1MSC4-M total cytoplasmic RNA and [**S] methbnine-labelled polypeptides separated by SDS-PAGE. The 
major translation product of pGS1 selected mRNA is a polypeptide of 42 kD MW which co-migrates with an 
amplified polypeptide in KG1 MSC4-M translations. pGS1 therefore contains genomic CHO DNA which contains 
at least part of the GS gene. This part of the work was carried out as described by Sanders and Wilson (loc 
25 cit). 

A 3.5 kb Hindi II fragment containing the 3' end of the GS gene from KG1MSC4-M was subcloned from 
pGS1 into pUC9 to form piasmid pGS113. A clone bank was prepared by cloning a Sau3A partial digest of 
KG1MSC4-M into the BamHI site of XL47. Recombinants were selected for hybridisation to pGS1 . A BamHI- 
EcoRI fragment from a selected U.47 recombinant was subcloned into pUC9 to form piasmid pGS2335 (Hay- 

30 ward et at, Nuc. Acid Res., 14, 999-1008, 1986). 

cDNA libraries were made from KG1 MSC4-M mRNA In pBR322 and Xgtl 0 using standard procedures. The 
mRNA was converted to cDNA using oligo-dT primed reverse transcriptase, and dsDNA made by the RNase 
H procedure (Gubler, U. and Hoffmann. V., Gene, 25, 263-269, 1983). The dsDNA was either tailed with C resi- 
dues (Michelson, A.M. and Orfcln, S.H., J. Biol. Chem., 257, 14773-14782, 1982), annealed to G-tatled pBR322 

35 and transformed into E. coll DH1 , or methylated and ligated to EcoRI linkers. Linkered DNA was digested with 
EcoRI and the linkers removed by Sephadex G75 chromatography in TNES (0.14 M NaCI, 0.01 M Tris, pH 7.6, 
0.001 M EDTA, 0.1% SDS). Linkered DNA in the excluded volume was recovered by ethanol precipitation and 
annealed to EcoRI-cut gt1 0 DNA. Following in vitro packaging, recombinant phage was plated on the high fre- 
quency lysogeny strain E. coll Hfl (Huyhn, T.V., Young RA and Davis, R.W., in "DNA cloning techniques II : 

40 A practical approach (Ed. Glover, D.M.), I.R.L Press Oxford, 1 985). About 5000 colonies and 20000 plaques 
were screened on nitrocellulose filters using nick-translated probes derived from pUC subclones of GS genomic 
sequences. A 1 kb EcoRi-Bglll fragment from pGS2335 was used as a 5' probe, and the entire 3.5 kb Hindlll 
fragment of pGS 1 1 3 was used as a 3' probe. Plasmids from positive colonies were analysed by restriction diges- 
tion of small-scale preparations of DNA and the longest clone (pGSC45) selected for further analysis. 

45 Positive X clones were plaque purified, grown up in 5000 ml of E.coii C600 liquid culture, and the phage 
purified on CsCi step gradients. DNA was prepared by formamide extraction (Davis, R.W., Bostein, D. and Roth, 
S.R., Advanced Bacterial Genetics, Cold Spring Harbor, 1980). Clones with the longest inserts were identified 
by EcoRI digestion and inserts subcloned into pAT153 and M13mp11 phage for further analysis and sequenc- 
ing. 

so The colonies or plaques were screened first with a probe derived from the 5' end of the GS gene. Positive 
colonies or plaques from this analysis were picked and rescreened with a longer probe covering most of the 
3' end of the gene. In this way It was anticipated that clones with long or possibly full length inserts would be 
selected and the tBdious rescreenlng for 5' ends would be avoided. Several piasmid clones and XgtlO recom- 
binants were derived by this means. Further analysis of one of the piasmid clones (pGSC45) by restriction 

55 enzyme digestion and partial sequencing revealed that it had an insert of about 2.8 kb and a polyA sequence 
at the 3' end. Northern blots indicate that a major mRNA for GS Is about this size (Sanders and Wilson, (loc. 
tit)), so the insert in pGSC45 was potentially a full length copy of this mRNA. The two clones (Jtga 1 .1 and Xgs 
5.21) had inserts of 1450 bp and 1 170 bp respectively. Restriction maps and alignment of the cDNA Inserts in 
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pGSC45. Xgs 1 .1 and Xgs 5.21 are shown in Figure 1 . tt is clear that the inserts in the X clones are considerably 
shorter at the 3' end than the plasmid done and may represent cDNA copies of one of the minor mRNAs. The 
insert In Xgs.1.1 extends some 200 base pairs at the 5' end. 

The nucleotide sequence of the mRNA coding for glutamine synthetase was obtained from M13 subclones 

5 of pOSC45 and EcoRJ subclones of Xgs 1 .1 and Xgs 5.21 and is shown in Figure 2. Some confirmatory sequ- 
ence was also obtained from the genomic done p6S1. Primer extension of OS mRNA with an oiigonudeotide 
complementary to nudeotides 147-166 gave a major extension product of 166 nudeotides. This shows that 
p6SC45 only lacks six or seven nudeotides from the 5' end of the mRNA. Nudeotide sequencing of the primer 
extended product by Maxam-GSbert sequencing confirmed this although the first two bases could not be deter- 

10 mined. 

Sequences at the 5' end of Xgs 1.1, which is some 200 bases longer at the 5' end than pGSC45, showed 
considerable inverted homology to sequences at the 3' end of this done (which was about 150 bases shorter 
at the 3' end than Xgs 5.21, (see Fig. 1). These additional sequences are probably doning artefacts, arising 
during second strand synthesis due to nudeotides 6 to 1 priming DNA synthesis via their complementarity to 

is nucelotides 1132-1 137 despite the fact that the RNase H procedure was used. It cannot be exduded that the 
duplication arises from transcription of a modified GS gene, produdng a modified mRNA which has been sub- 
sequently doned, although the primer extension results did not suggest that there was any major mRNA species 
with a 5' end longer than 166 nudeotides. 

The predicted amino add sequence for CHO glutamine synthetase is shown in Figure 2. The NH 2 terminus 

20 was identified by homology with the NH 2 terminal peptide found in bovine brain glutamine synthetase (Johnson, 
R.J. and Piskiewicz, D„ Biochem. Btophys. Acta, 827, 439-446, 1985). The initiating AUG follows a precise 
CCACC upstream consensus sequence found for true initiation codons and is followed by a purine (Le. CCAC- 
CATGG). (Another AUG codon at position 14 is not in a favourable context by the same criteria and is followed 
by a termination codon in frame 21 nudeotides downstream.) The predicted amino add composition of the GS 

6 protein gives a molecular weight of 41 ,964 (not allowing for N-terminal acetylation or other post-translational 
modifications), in agreement with other estimates. The basic nature of the protein is reflected in the excess of 
arginine, histidine and lysine residues over those of aspartate and glutamate. 

The predicted amino add sequence shows excellent homology with bovine and other GS derived peptide 
sequences obtained by peptide sequendng, indicative of an accurate DNA sequence. (The amino acid sequ- 
30 ence allows the ordering of all the cyanogen bromide peptides and most of the tryptic peptides published for 
bovine GS). 

The CHO sequence also shows some homology with the GS sequence from the cyanobacterium 
Anabaena, notably at residues 31 7-325, (NRSASIRIP), which are an exact match to Anabaena residues 342- 
350. In addition, related sequences can be found in glutamine synthetases isdated from plants. 
35 Access to complete cDNA dones and genomic dones for Chinese hamster GS has not only allowed the 
amino add sequence of glutamine synthetase to be predicted, but also allows a detaled analysis of the position 
of the introns within the gene and their relationship to the exons coding for the structural domains of the protein. 

A GS minigene was constructed from a cDNA sequence (spanning the majority of the protein coding region) 
and a genomic sequence (which recreates the 3' end of the coding sequence). The 3.4 kb EcoRI-Sstl fragment 
40 of pGS1 encodes a single intron, all of the 3' untranslated region of both mRNAspedes identified and contains 
about 2 kb of 3' flanking DNA. This DNA fragment was doned between the EcoRI and BamHI sites of pCT54 
(Emtage et a!., PNAS-USA, 80, 3671 -3675, 1 983) to create pCTGS. The 0.8 kb EcoRI fragment of Xgs 1 .1 was 
then inserted at the EcoRI site of pCTGS in the correct orientation to recreate the 5' end of the gene. The late 
promoter of SV40 was doned upstream by inserting the 342 bp Pvull — Hindlll fragment of SV40, containing 
45 the origin of replication, at the Hindlll site of the above plasmid in the appropriate orientation to produce plasmid 
pSVLGS-1 which is shown in Figure 3(a). 

An alternative GS expression construct was made by pladng cDNA containing all of the GS coding sequ- 
ences between sequences from SV40 which direct effident expression in mammalian cells. The 1.2 kb Nael- 
Pvull fragment of Xhgs 1.1 was doned in place of dhfr sequences In pSVZdhfr, (Subramani, S., Mulligan, R. 
50 and Berg, P., Md. Cell. Bid., 1, 854-864, 1981) between the Hindlll and Bglll sites to form pSVZGS which Is 
shown In Figure 3(b). 

In order to place GS coding sequences under the control of the Mdoney murine leukaemia vims (MMLV) 
LTR promoter, the Hindi! — BamHI fragment from pSV2.GS (see Figure 3b) was introduced at the BamHI site 
of pZIP-NeopSV(X) (Cepko, C.L, Roberts, B.E. and Mulligan, R.C., CeQ, 37, 1053-1082, 1984). 
55 The 3.0 kb Hindlll — BamHI fragment of ptPA 3.16 (Stephens, P.E^ Bendig, M.M. and Hentschel, C.C, 
manuscript In preparation) contains a cDNA coding for tissue plasminogen activator, downstream of which Is 
the SV40 small t-intron and the pdyadenytation signal from the early region transcript of SV40. This fragment 
was doned in a 3-way ligation with the 342 bp SV40 Pvull— Hindlll fragment into the BamHI site of pSVLGS.1 
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so that the tPA gene was under the control of the SV40 early promoter. This generated two plasmlds, 
pSVLGStPA16, In which the GS and tPA transcription untts are in tandem, and pSVLGStPA17, in which the 
two genes are in opposite orientations. 

CHO-K1 ceOs, obtained from ATCC, were grown In Glasgow modified Eagle's medium (GMEM) without 

6 gJutamlne and supplemented with 10% dlalysed foetal calf serum (GIBCO), 1 mM sodium pyruvate, non-essen- 
tial amino acids (alanine, aspartate, glycine and serine at 100 nM, asparaglne, glutamate and proline at 500 
>iM) and nucleosides (adenosine, guanoslne, cytidine and uridine at 30 pM and thymidine at 10 nM). For selec- 
tion, L-methlonine sulphoximine (Msx from Sigma) was added at appropriate concentrations. Approximately 3 
x 10 s cells per 100 mm petri dish were trensfected with 10 jig circular plasmid DNA according to the calcium 

10 phosphate co-preclpitation procedure (Graham, F.L and van der Eb, A.J., Virology, 52. 456-467, 1 983). Cells 
were subjected to a glycerol shock (15% glycerol In serum-free culture medium for 2 minutes) 4 hours after 
transfection (Frost, E. and Williams, J. f Virology, 91, 39-50, 1978). One day later, transf acted cells were fed 
with fresh selective medium and colonies of surviving cells were visible within 2-3 weeks. 

tPA activity in cell culture supematants was measured using a fibrin-egarose plate assay using a tPA sten- 
ts dard (Biopool) for comparison. Attached cells were typically washed In serum-free medium and incubated for 
18-20 hours in serum-free medium at 37°C. After removal of medium samples for assay, the cells were tryp- 
sinised and viable ceDs counted. Results were then expressed as units of tPA/10 6 cells/24 hours. Colonies of 
cells in petri dishes were assayed for tPA production by overlaying directly with a fibrin agarose gel. 

In the giutamine-free medium used In these experiments, the specific GS inhibitor, Msx, is toxic to CHO-K1 

20 cells at concentrations above 3 jiM. To test whether the GS expression plasmlds could synthesise functional 
GS In vivo , each plasmid was introduced Into CHO-K1 cells by calcium phosphate — mediated transfection 
and tested for the ability to confer resistance to higher concentrations of Msx. 

Resistance to Msx can, however, also arise by amplification of the endogenous GS genes (or perhaps by 
other unknown mechanisms). Therefore, In order for a GS expression vector to be useful as a dominant select- 

25 able marker, it must confer resistance to a particular concentration of Msx with a greater frequency than the 
frequency of spontaneously resistant mutants. The frequency with which spontaneously resistant clones are 
detected depends on the concentration of Msx used for selection. Thus, for instance gene amplification in CHO 
K1 cells leads to approximately 1 surviving colony/10 4 cells plated in 10 jiM Msx, but this frequency declines 
to less than 1/10 7 if cells are selected for resistance to 25 uM Msx. 

30 Since the frequency of transfection of CHO cells using the calcium phosphate co-precipitate technique is 
generally reported to be less than 1/1 0 3 , a range of Msx concentrations was chosen for selection in excess of 
10 uM. The results in Table 1 show that transfection with any of the three GS expression plasmlds leads to 
survival of a greater number of Max-resistant colonies than the background frequency detected in mock- 
trensfected ceOs when selected at 15 uM or 20 jiM Msx. 

35 pZIPGS yields only a slight increase in the number of surviving colonies above background. This vector 
would therefore be a poor selectable marker and was not studied further. pSV2.GS and pSVLGS.1 , however, 
both appear to act as effective dominant selectable markers in this cell line. The frequency with which resistant 
colonies arise after transfection with either plasmid in these experiments is at least 25 times the frequency due 
to endogenous amplification if selection is carried out at 15-20 *iM Msx. Apparent transfection frequencies for 

40 pSV2.GS of up to 3.8/1 0 8 cells and for pSVLGS.1 of up to 2.5/1 0 s ceils were observed. The differences in appa- 
rent transfection frequencies between the three plasmlds are likely to reflect differences in the efficiency with 
which the GS gene is expressed in the above three vectors. 

An independent estimate of transfection efficiency can be obtained in the case of pZIPGS since the vector 
also contains a neo gene which confers resistance to the antibiotic G418. Selection with G418 instead of Msx 

45 yielded a transfection frequency substantially higher than obtained by selection In 14-20 pM Msx (see Table 
1), indicating that the vector is being taken up by the cells and reinforcing the view that the GS gene is relatively 
poorly expressed in this vector. 
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In order to confirm that the generation of Max-resistant colonies is due to expression of transfected GS 
genes, rather than to some non-specific effect of the Input DNA, there are three predictions which can be tested. 
Firstly, the Msx-resistant cells should contain vector DNA. Secondly, novel GS mRNAs should be produced in 
these ceU lines, since the heterologous promoters used wOl direct the formation of GS mRNAs which differ in 
5 length at the 5' end from the natural GS mRNA. Thirdly, active transfected GS genes should be amplifiable by 
selection in increased concentration of Max. These predictions were therefore tested as follows. 

Three cefl lines were established from individual colonies arising after transfection with pSVGS.1 and three 
cell lines from colonies transfected with pSV2.GS. Cell lines SVLGS 2 and SVLGS 5 are resistant to 20 pM 
Msx and SVLGS 9 to 30 pM Msx. Cell lines SV2.GS20, SV2.6S25, and SV2.GS30 are resistant to 20, 25 and 
10 30 pM Msx respectively. 

DNA was prepared from each of these cell lines and a Southern blot of the DNA samples was hybridised 
with an RNA probe specific for SV40-ORI region DNA. The result shown in Figure 4. indicates that all of the 
Msx-resistant cell lines contain vector DNA. The number of copies of the vector present in each eel) can be 
estimated by comparison with known amounts of 8 standard preparation of vector DNA, loaded on the same 
1 5 gel. From this, it is dear that all of the SVLGS cell lines contain multiple copies of the vector up to about 500 
copies per cell (see Table 2). All of the SV2.GS cell lines also contain vector DNA but in all three cases there 
seems to have been integration of only a single copy of vector DNA per cell. 

It is to be noted that the result obtained with pSVLGS.1 is highly unexpected. Up until the present there 
has been no reported case In which such a high copy number has been produced merely by transfection. It is 
20 believed that this high copy number is due to the presence in the vector of a DNA sequence which favours the 
incorporation of high numbers of copies of vector DNA Into the host cell's DNA. 

Such high copy numbers of integrated vectors have not been observed with pSVZGS. It is therefore 
believed that DNA sequences partly responsible for the high copy number transfection are found either In the 
Intron or in the 3' region of the genomic GS DNA part of the pSVLGS.1 vector or adjoining vector sequences. 
25 However, the copy number probably also reflects the expression level required to attain resistance to the par- 
ticular level of Msx used for selection. 

Clearly, this high copy number transfection sequence wQI be of use not only with GS encoding sequences 
but also with other protein sequences, such as those encoding selectable markers or amplifiable genes 
because it provides a means of increasing copy number and hence expression levels of desired genes 
so additional to the effects of selection for further gene amplification. 

Therefore according to a further aspect of the invention there Is provided the recombinant DNA sequence 
present in the pSVLGS.1 vector which is responsible for achieving high copy number transfection of vector DNA 
Into a host cell or any other recombinant DNA sequence which will provide the same function. 

The 5' ends of GS mRNA produced by Msx-resistant cell lines were analysed by primer extension analysis. 
35 A synthetic oligomer 1 9 bases in length was synthesised which hybridises to a region of the mRNA near the 
start of the protein coding region. Reverse transcriptase should extend this primer to a length of 146 bp from 
wild type GS mRNA and to a length of approximately 400 bp to the start of transcription in the case of pSVLGS.1 
mRNA. The RNA predicted from pSV2.GS is shorter than the natural mRNA and so could be masked by •drop- 
offs" In the primer extension reaction and was not analysed. 
40 The results shown In Figure 5 show that a GS specific mRNA longer than wild-type mRNA is indeed pro- 
duced in SVLGS cell lines, strongly supporting the conclusion that the transfected gene is transcribed in these 
cells. The reverse transcriptase does not extend the primer to the predicted length, but seems to drop off at at 
least 3 major sites, probably due to Inhibition of reverse transcription by secondary structure in the 5* untrans- 
lated region of this RNA. 

45 Three Msx-resistant cell lines transfected with pSVLGS.1 and three cell lines transfected with pSVZGS 
were grown in various concentrations of Msx In order to select for GS gene amplification. For each cell line, 
approximately 10* ceils were plated in 100 pM, 250 pM, 500 pM and 1 mM Msx. After 12 days, the maximum 
concentrations of Max at which surviving colonies could be observed in each cell line were as follows : SVLGS2, 
500 pM ; SVLGS5. 250 pM ; SVLGS9, 500 pM ; SV2.GS20, 100 pM ; SV2.GS25, 500 pM ; and SV2.GS30, 

so 500 pM. The most highly resistant colonies obtained from each cell line were pooled and two of these Msx- 
resistant pools were subjected to a second round of amplification. SVLGS2 (500 pMR) and SV2.GS30 (500 
pMR) were plated out at 1 mM, 5mM, 10 mM and 20 mM Msx. After 15-20 days, colonies appeared on plates 
containing SVLGS2 (500 pMR) at up to 2 mM Msx and in the case of SV2.GS (500 pMR) at up to 1 0 mM Msx. 
From these, two highly resistant cell lines SVGS2 (2 mMR) and SV2.GS30 (10 mMR) were established. Each 

55 of these highly resistant cell lines contain cells which have arisen from multiple independent amplification 
events. 

A Southern blot of DNA prepared from all of the Msx-resistant cell lines was hybrised with a probe specific 
for SV40 ORl-reglon DNA. The results of this are shown in Figure 3. From a comparison with standard prepa- 
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rations of plasmld DNA, the copy numbers could be determined and these are shown In Table 2. 

After the first round of selection, all three SVLGS cell lines show approximately a 10-fold increase in copy 
number of the vector DNA. 



TABLE 2 



10 



Copy Number of Trans fected Genes Subjected to 
Selection for gene amplification 



1S 



Cell Line 



Cone, of 
Msx (MM) 



Copy Number 



20 



SVLGS 2 
SVLGS 5 
SVLGS9 



20 
20 
30 



170 
25 
500 



25 



SVLGS2(500 fJKR) 500 
SVLGS5(250 (MR) 250 
SVLGS9(500 fOSR) 500 



1200 
300 
4200 



90 



SVLGS2(2 mMR) 

SV2.GS20 
SV2.GS25 
SV2.GS30 



2000 

20 
25 
30 



15000 

1 
1 
1 



40 



SV2.GS2 0(100 MMR) 100 

SV2.GS25(500 MMR) 500 

SV2.GS3 0(500 mMR) 500 

SV2.GS30(10 mMR) 10,000 



1 
1 
1 

5-10 



45 In the second round of selection, SVLGS2 shows at least a further 1 0 fold amplification attaining approxim- 
ately 15,000 copies/cell. 

In marked contrast, the single copy of pSV2.GS present in initial transfectants is not significantly Increased 
after a single round of selection and SV2.GS30 (10 mMR) resistant to 10 mM Max contains only 5-10 copies 
of the vector in each ceO. 

to In order to determine whether there has also been amplification of the endogenous GS genes, the probe 
was removed and the blot re-probed with a nick- translated Bgll-Bglll DNA fragment obtained from the third 
intron of the GS genomic sequences. This probe is therefore specific for endogeneous GS genes and does 
not hybridise with the transfectsd genes which lack this intron. No significant endogenous gene amplification 
could be detected by this means in SVLGS cell lines. A small degree of endogenous amplification could be 

05 seenlnSV2.GS30(10mMR)eellDNA. 

Thus pSV2.GS, whle acting as an effective dominant selectable marker in CHO-K1 cells, appears to exp- 
ress GS too efficiently to be suitable as an ampfifiable marker, since very high levels of Msx are required In 
order to selectfbr even slightly increased copy number. pSVLGS.1 on the other hand can be used as a dominant 
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selectable marker and can also be amplified to very high copy numbers. 

The suitability of pSVLGS.1 as a selectable and amplrfiaWe vector was tested by introducing into it a tran- 
scription unit capable of expressing tissue plasminogen activator (tPA). Two plasmids were examined in which 
tPA cDNA under the control of the SV40 early region promoter and polyadenylation signal was cloned at the 

5 unique BamHI site of pSVLGS.1. In pSVL.GS.tPA1 6, the GS and tPA genes are in the same orientation and in 
pSVL.GS.tPA1 7, the two genes are in opposite orientations. 

Both constructions were Introduced Into CKO-K1 cells and transfected cells were selected for resistance 
to 15 uM Msx. After 10 days, the surviving colonies were screened for tPA activity by fibrin overlays. Many of 
the surviving colonies secreted tPA, thus confirming that the GS gene could act as a selectable marker to iden- 

10 tffy transfected clones. The tPA-lnduced clearings in the fibrin gel were larger and more numerous on plates 
transfected with pSVGS.tPA 1 6, Indicating that the tPA gene was more efficiently expressed when in the same 
orientation in the vector as the GS gene than when the two genes were in opposite orientations. 10 colonies 
from e transection with pSVL.GS.tPA1 6, which produced large tPA clearings, were grown in 96-well plates. Of 
these, the two cell lines secreting the highest levels of tPA, 16-1.20 uMR and 16-£20 uMR were selected for 

15 further study. Each was subjected to selection in increased concentrations of Msx and the tPA production from 
pools of colonies obtained at different stages is shown in Table 3. 



20 



30 



XftPLF 3 

Cell line tPA secreted fU/10 6 cells/24 hQWTg) 

16-1.20 JiMR 260 

16-1.200 MMR 2700 

16-2.20 MMR 40° 

16-2.200 MMR 2750 

16-2.10 mMR 4000 



16-2.10 mMR, the cell line producing the highest levels of tPA, was cloned by limiting dilution and a done 

35 was Isolated which secreted 4000 U/10* cells/day. This level is comparable with the highest level of tPA exp- 
ression reported using DHFR co-amplification. 

It has thus been shown that, when a GS cDNA cloned in the retrovirus based vector pZJP-Neo S V(X) was 
used, the frequency with which Msx-resistant colonies arose was low, probably due to relatively inefficient exp- 
ression from this vector In this cell line. On the other hand, two different constructs in which the GS gene was 

40 under the control of SV40 promoters gave rise to cells resistant to substantially higher levels of Msx than wild- 
type cells. All of the resistant colonies tested contained vector DNA, and novel GS mRNAs consistent with tran- 
scription of the transfected genes could be detected in cell lines containing pSVLGS.1 DNA. Msx-resistant 
colonies could be identified using both GS expression plasmids using SV40 promoters at a frequency greater 
than 1/10 6 cells, indicating that both constructs could be useful as dominant selectable markers for the intro- 

45 duction of cloned DNA Into CHO-K1 ceils. 

The expression plasmid pSVLGS.1 containing a GS minigene utilising its own RNA processing signals and 
under the control of an SV40 late promoter, can unexpectedly be used to introduce a high number of copies 
of the vector into each transfected cell. 

Both GS genes under the control of SV40 promoters were capable of further amplification when transfected 

so ceil lines were selected In higher concentrations of Msx. Ceil lines expressing pSV2.GS yielded variant clones 
resistant to very high levels of Msx (up to 65 times higher than originally used to select transfectants) with an 
increase in copy number to only 5-1 0 per cell. There was little detectable concomitant amplification of endogen- 
ous genes. 

pSVLGS.1 is 8 much more suitable ampIKiabte vector since the increase in copy number was roughly pro- 
55 portions! to the concentration of Msx and very high copy numbers were achieved (approximately 1 0,000 copies 
per cell in ceDs resistant to 2 mM Msx). In this case, no detectable endogenous gene amplification occurred. 

ThepSVLGS.1 amplifiable vector has been used to introduce a tPA gene tntoCHO-K1 cells and It has been 
shown that gene amplification leads to higher levels of tPA expression. Variant clones resistant to ten times 
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the concentration of Max of the original transfectants secrete about ten times the amount of tPA, but a further 
50 fold increase In Msx-resistance led to less than a 2 fold Increase In tPA secretion. This suggests that some 
aspect of the synthesis or secretion of tPA is dose to saturation in these highly Msx-resistant cells. The 
maximum level of tPA secretion of 4000 U/10* cells/day in the 1 6-2.10 mMR cell line is comparable with the 

5 levels of expression previously observed in dhfr CHO ceils using DHFR-mediated gene amplification, the high- 
est reported level of secretion being 6000 U/10* cells/day. This also supports the conclusion that tPA secretion 
is close to the maximum attainable by current methods In these cells. 

It will be appreciated that the present Invention is described above purely by way of illustration and that 
modifications and variations thereof may be made by the person skilled in the art without departing from the 

10 spirit and scope thereof as defined in the appended claims. 



Claims 

15 1 . A method for co-amplifying a recombinant DNA sequence which encodes the complete amino acid sequ- 
ence of a desired protein other than a glutamine synthetase (GS), which method comprises : 



(a) providing a vector capable, in a transformant host cell, of expressing both a recombinant DNA sequence 
20 which encodes an active GS enzyme and the recombinant DNA sequence which encodes the complete 

amino acid sequence of the desired protein other than GS ; 

(b) providing a eukaryotic host ceO which is a glutamine prototroph ; 

(c) transforming said host cell with said vector ; and 

(d) culturing said host cell under conditions which allow transformants containing an amplified number of 
25 copies of the vector-derived GS-encoding recombinant DNA sequence to be selected, which transfor- 
mants also contain an amplified number of copies of the desired protein-encoding DNA sequence. 



2. A method for co-amplifying a recombinant DNA sequence which encodes the complete amino acid sequ- 
30 ence of a desired protein other than a GS, which method comprises : 



(a) providing a first vector capable, in a transformant host cell, of expressing a recombinant DNA sequence 
which encodes an active GS enzyme ; 
35 (b) providing a second vector capable, in a transformant host cell, of expressing the recombinant DNA sequ- 
ence which encodes the complete amino acid sequence of the desired protein other than GS ; 

(c) providing a eukaryotic host cell which Is a glutamine prototroph ; 

(d) transforming said host ceil with both said first and said second vectors ; and 

(e) culturing said host cell under conditions which allow transformants containing an amplified number of 
40 copies of the vector-derived GS-encoding recombinant DNA sequence to be selected, which transfor- 
mants also contain an amplified number of copies of the desired protein-encoding DNA sequence. 



3. The method of claim 1 or claim 2, wherein the culturing step [(d) or (e), respectively] comprises culturing 
45 the transformed host cell in media containing a GS inhibitor and selecting for transformant cells which are resis- 
tant to progressively increased levels of the GS Inhibitor. 

4. The method of claim 3, wherein the GS inhibitor is phosphinothricin or methionine sulphoximine. 

5. The method of claim 3 or claim 4, wherein the media containing the GS inhibitor also contain methionine, 
whereby the concentrations of GS inhibitor in the media can be reduced. 

so 6. The method of any one of claims 1 to 5, wherein the GS-encoding recombinant DNA sequence is under 
the control of a regulatable promoter. 

7. The method of claim 6, wherein the regulatable promoter is a heat shock promoter or a metallothioneln 
promoter. 

8. The method of claim 6 or daim 7, wherein the regulatable promoter Is up-regulated during the culturing 
55 and selecting steps and Is down-regulated after selection. 

9. The method of any one of claims 1 to 8, wherein the desired protein is tissue plasminogen activator. 

10. The method of any one of claims 1 to 9, wherein the host cell is a mammalian cell. 

1 1. The method of claim 10, wherein the host cell Is a CHO-K1 cell. 
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12. A method for using a vector as a dominant selectable marker in a cotra reformation process which com- 
prises: 



* (a) providing a vector capable, in a transfbimant host cell, of expressing a recombinant DNA sequence 
which encodes an active OS enzyme and a recombinant DNA sequence which encodes the complete 
amino acid sequence of a desired protein other than GS ; 

(b) providing a eukaryotic host cell which is a gtutamine prototroph ; 

(c) transforming the host cell wfth the vector ; and 

10 (d) selecting transformant cells which are resistant to GS inhibitors, 

whereby transformant cells are selected In which the vector-derived GS-encoding sequence serves as 
a dominant selectable and co-emp!rfiable marker. 
13. A method for using a vector as a dominant selectable marker in a cotransformation process which com- 
16 prises : 



(a) providing a vector capable, in a transformant host cell, of expressing a recombinant DNA sequence 
which encodes an active GS enzyme ; 
20 (b) providing a second vector capable, in a transformant host cell, of expressing a recombinant DNA sequ- 
ence which encodes the complete amino acid sequence of a desired protein other than GS ; 

(c) providing a eukaryotic host which is a glutamlne prototroph ; 

(d) transforming said host cell with both said first and second vectors ; and 

(e) selecting transformant cells which are resistant to GS Inhibitors, 

25 

whereby transformant cells are selected in which the vector-derived GS-encoding sequence serves as 
a dominant selectable co-amplifiable maiker. 
14. A recombinant DNA vector comprising : 



30 

(a) a recombinant DNA sequence which encodes an active GS enzyme ; and 

(b) a recombinant DNA sequence which encodes the complete amino acid sequence of a desired protein 
other than GS, 

35 the vector being capable, in a transformant host cell, of expressing both said recombinant DNA sequ- 

ences (a) and (b). 

1 5. A plasm W containing a GS mlnlgene, said minigene comprising a cDNA fragment having the sequence 
of residues 1 to 753 of the cDNA sequence shown in Figure 2 and a 3.4 kb EcoRI — Ssti fragment of hamster 
genomic DNA which encodes mRNA corresponding the sequence of residues 754 to 1421 of the cDNA sequ- 

40 ence shown in Figure 2, the 3' end of the cDNA fragment being fused directly to the 5' end of the genomic frag- 
ment 

16. A plasmid containing an SV40 late promoter fused upstream of a GS minigene as defined in claim 15 
such that the SV40 late promoter is capable of directing transcription of an mRNA encoding GS. 



45 

PatontansprQcho 

1 . Verfahren zur Co-Amplffizlerung einer rekombinanten DNA-Sequenz, die fur die komplette Amlnosfiu- 
resequenz eines gewOnschten Proteins, das nicht sine Glutaminsynthetase (GS) 1st, codiert, welches Verfah- 
so ran umfafit : 



a) Bereiteteilung eines Vektors, der imstande 1st, in einer transformanten Wirtszelle sowohl eine rekombi- 
nante DNA-Sequenz, die fur em aktives GS-Enzym codiert, als auch die rekomblnante DNA-Sequenz, 

55 die fur die komplette Aminosiuresequenz des gewOnschten Proteins, das keln GS 1st, codiert, zu expri- 

mieren, 

b) Bereiteteilung einer eukaryotischen Wirtszelle, die ein Glutamin-Prototroph 1st, 
e) Transform ieaing dieser Wirtszelle mlt dem genannten Vektor und 
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d) Zuchtung der Wirtszelle unter Bedingungen, die Transformanten ermdglichen, die eine vermehrte 
Anzah) von Kopien der zu selektierenden, von dem Vektor abgeleiteten, fQr GS codierenden, rekombk 
nanten DNA-Sequenz enthalten, wobeJ diese Transformanten auch eine vermehrte Anzahl von Kopien 
der fQr das gewOnschte Protein codierenden DNA-Sequenz enthalten. 



Z Verfahren zur Co-Ampltfeierung einer rekombinanten DNA-Sequenz, die fur die komplette AmlnosSu- 
resequenz eines gewflnschten Proteins, das kein GS 1st, codiert, welches Verfahren umfafit : 

10 

a) Bereitstellung eines ersten Vektors, der imstande 1st, in einer transformanten Wirtszelle eine rekombl- 
nante DNA-Sequenz zu exprimieren, die fur ein aktrves GS-Enzym codiert, 

b) Bereitstellung eines zweften Vektors, der imstande 1st, in einer transformanten Wirtszelle die rekombi- 
nante DNA-Sequenz zu exprimieren, die fur die komplette AminosSuresequenz des gewunschten Pro- 
fs teins, das kein GS 1st, codiert, 

c) Bereitstellung einer eukaryotischen Wirtszelle, die ein Glutamin-Prototroph 1st, 

d) Transformation dieser Wirtszelle sowohl mlt dem ersten als auch mlt dem zweiten Vektor, 

e) ZOchtung dieser Wirtszelle unter Bedingungen, die Transformanten ermdglichen, die eine vermehrte 
Anzahl von Kopien der zu selektierenden,von dem Vektor abgeleiteten, fflr GS codierenden, rekombi- 

20 nanten DNA-Sequenz enthalten, weiche Transformanten auch eine vermehrte Anzahl von Kopien der 

fur das gewunschte Protein codierenden DNA-Sequenz enthalten. 



3. Verfahren nach Anspruch 1 Oder 2, bei welchem der ZOchtungsschritt [(d) bzw. (e)] die ZOchtung der 
25 tranfbrmierten Wirtszelle in Medien, die einen GS-lnhlbitor enthalten, und die Selektierung von transformanten 

Zellen, die gegenQber progessrv ansteigenden Gehalten des GS-lnhibitors resistent sind, umfa&t 

4. Verfahren nach Anspruch 3, bei welchem der GS-lnhibitor Phosphlnothricln oder Methlonlnsulfoximin 

1st 

5. Verfahren nach Anspruch 3 oder 4, bei welchem die Medien, die den GS-lnhibitor enthalten, auch Methio- 
30 nin enthalten und die Konzentrationen des GS-lnhibitors in den Medien herabgesetzt warden konnen. 

6. Verfahren nach einem der Ansprflche 1 bis 5, bei welchem die fur GS codierende rekombinante DNA- 
Sequenz unter der Steuerung eines regulierbaren Promoters steht 

7. Verfahren nach Anspruch 6, bei welchem der regulierbare Promoter eine Hitzeschock-Promotor oder 
ein Metallothionein-Promotor 1st 

35 8. Verfahren nach Anspruch 6 oder 7, bei welchem der regulierbare Promoter wShrend der ZOchtungs- und 
Selektionsstufen aufw&rtsreguliert und nach der Selektlon abwirtsreguliert 1st 

9. Verfahren nach einem der Ansprucrte 1 bis 8, bei welchem das gewunschte Protein ein Gewebe-Plas- 
minogen-Aktivator 1st 

10. Verfahren nach einem der AnsprOche 1 bis 9, bei welchem die Wirtszelle eine Sfiugetierzelle 1st 
40 11. Verfahren nach Anspruch 10, bei welchem die Wirtszelle eine CHO-K1-Zelle 1st 

1 2. Verfahren zur Verwendung eines Vektors als dominant sel ektierbarer Marker in einem CoTransfbma- 
tionsverfahren, welches umfattt : 



45 a) Bereitstellung eines Vektors, der imstande 1st in einer transformanten Wirtszelle eine rekombinante 
DNA-Sequenz, die fQr ein aktives GS-Enzym codiert, und eine rekombinante DNA-Sequenz, die fur eine 
komplette Aminosfiuresequenz eines gewunschten Proteins, das kein GS 1st, codiert, zu exprimieren, 

b) Bereitstellung einer eukaryotischen WktszeDe, die ein Glutamin-Prototroph ist, 

c) Transform ierung der Wirtszelle mlt dem Vektor und 

so d) Seiektion der transformanten Zellen, die gegenQber GS-lnhlbitoren resistent sind, 

wobei transfbrmante Zellen selektiert werden, in welchen die von dem Vektor abgeleitete GS-codie- 
rende Sequenz als ein dominant selektierbarer und co-amplifizierbarer Marker dtent 
13. Verfahren zur Verwendung eines Vektors als dominant selektierbarer Marker in einem Co-Transfbr- 
55 mationsverfahren, welches umfaBt : 



a) Bereitstellung eines Vektors, der imstande ist, In einer transformanten Wirtszelle eine rekombinante 
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DNA-Sequenz zu axprtmieren, die fur ein aktives GS-Enzym codiert, 

b) Bereltstellung eines zwerten Vektors, der imstande 1st, In einer transformanten Wirtszelle eine rekom- 
blnante DNA-Sequenz zu exprimleren, die fur ein gewGnschtes Peptid, das keln GS 1st, codiert 

c) Bereltstellung eines eukaryotischen Wirts, der ein Glutamin-Prototroph ist, 

5 d) Transformierung dieser Wirtszelle mit sowohl dem ersten ais auch dem zweiten Vektor und 
e) Selektion der transformanten Zellen, die gegen GS-lnhMoren resistent sind, 

wobei transfbrmante ZeQen selekfiert warden, In welchen die Vektor-abgeleitete GS-codierende 
Sequenz ais ein dominant seiektierbarer co-amplifizierbarer Marker dient 
10 14. Rekomblnanter DNA-Vektor, der enthalt : 



a) eine rekombinante DNA-Sequenz, die fflr ein aktives GS-Enzym codiert, und 

b) eine rekombinante DNA-Sequenz, die fQr die komplette Aminosfturesequenz eines gewunschten Pro- 
is tains, das kein OS 1st, codiert 

wobei der Vektor imstande 1st, in einer transformanten WirtszeJIe beide genannten rekombinanten DNA- 
Sequenzen a) und (b) zu exprfmieren. 

15. Plasmid enthaltend ein GS-Minigen, welches seinerseits ein cDNA-Fragment mit der Sequenz der 
20 Reste 1 bis 753 der In Rg. 2 gezelgten cDNA-Sequenz und ein 3, 4 kb EcoRISsti-Fragment von Hamsterge- 

nom-DNA, das fDr mRNA codiert, die der Sequenz der Reste 754 bis 1421 der in Fig. 2 gezeigten cDNA-Se- 
quenz entspricht, umfaSt, wobei das 3'-Ende des cDNA-Fragments direkt an das 5'-Ende des genomischen 
Fragments fusioniert IsL 

16. RasmkJ enthaltend einen spiten SV40-Promotor, der stromaufwarts eines GS-Minigens nach der 
25 Definition von Anspruch 15 fusioniert 1st, sodaB der spite SV40-Promotor die Transcription einer fQr GS codie- 

renden mRNA zu leiten Imstande ist 



Revendicatlons 

1. Precede de co-amplification d v une sequence d'ADN recombinant qui code pour la sequence complete 
d'amlno-acldes d'une proteine desiree autre qu'une glutamine synthetase (GS), lequei precede comprend : 



35 (a) Putll teation d'un vecteur capable, dans une cellule hdte transformante, d'exprimer a la fob une sequence 
d'ADN recombinant qui code une enzyme GS active de la sequence d'ADN recombinant et qui code 
pour la sequence complete d'amino-acldes de la proteine desiree autre que la GS ; 

(b) ^utilisation (Tune cellule h6te eucaryote qui est un prototrophe de glutamine ; 

(c) la transformation de ladite cellule hfite par ledft vecteur ; et 

40 (d) la culture de ladite cellule hdte dans des conditions qui permettent a des transformants contenant un 
nombre amplifie de copies de la sequence d'ADN recombinant derives du vecteur, codant pour la GS 
d'etre selectionnes, iesquels transformants contiennent egalement un nombre amplifie de copies de la 
sequence d'ADN codant pour la proteine desiree. 

45 

2. Precede de co-amplification d'une sequence d'ADN recombinant qui code pour la sequence complete 
d'amino-acides d'une proteine desiree autre qu'une GS, lequei precede comprend : 



50 (a) rutili8ation<run premiervecteurcapable.dansunecellule h6te transformante, d'exprimer une sequence 
d'ADN recombinant qui code pour une enzyme GS active ; 
(b) rutilisation d'un second vecteur capable, dans une cellule hdte transformante, d'exprimer la sequence 
d'ADN recombinant qui code pour la sequence complete d'amino-acldes de la proteine desiree autre 
que le GS ; 

55 (c) rutilisation d'une cellule h6te eucaryote qui est un prototrophe de glutamine ; 

(d) la transformation de ladite collide hOte a la fois par ledit premier et ledlt second vecteurs ; et 

(e) la culture de ladite cellule hflte dans des conditions qui permettent a des transformants contenant un 
nombre amplifie de copies de la sequence d'ADN recombinant derfvee des vecteurs, codant pour la GS 
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d'etre s6lectionn6s, lesquels transformants oontiennent egalement un nombre amplifie de copies de ta 
sequence d'ADN codant pour la proline d6sir6e. 



5 3. Proc6d6 selon la revendication 1 ou la revendication 2, dans lequel I'etape de culture [(d) ou (e), res- 
pectivement] comprend la culture de la cellule hdte transform^ dans des mflieux contenant un inhlbiteur de 
la GS et la selection en ce qui conceme des cellules transformantes qui sont r6stetantes 6 des concentrations 
augmentees progress h/ement & Hnhibiteur de la GS. 

4. Proc6d6 selon la revendication 3, dans lequel I'inhlbltaur de la GS est la phosphinothricine ou la sul- 
10 foximine de la methionine. 

5. Proc6d6 selon la revendication 3 ou la revendication 4, dans iequel les milieux contenant I'inhibiteur de 
la GS contiennent 6ga!ement de la methionine, ce grfice 6 quo! les concentrations de Pinhibiteur de la GS dans 
les milieux peuvent fitre rtduites. 

6. Proc6d6 selon I'une quelconque des revendlcatlons 1 ft 5, dans lequel la sequence d'ADN recombinant 
is codant pour la GS est sous le contrdle (Tun promoteur r6glable. 

7. Proc6d6 selon la revendication 6, dans lequel le promoteur r6glable est un promoteur de type choc ther- 
mique ou un promoteur de type m6tallothion6ine. 

8. Precede selon la revendication 6 ou la revendication 7, dans lequel le promoteur r6g!able est regie vers 
le haut pendant les etapes de culture et de selection et est regie vers le bas aprfes la selection. 

20 9. Precede selon I'une quelconque des revend Jcations 16 8, dans lequel la proteine d6slr6e est I'activateur 
tissulaire du plasminogdne. 

10. Precede selon I'une quelconque des revend ications 16 9, dans lequel la cellule hdte est une cellule 
de mammif&re. 

11. Precede selon la revendication 10, dans lequel la cellule hdte est une cellule de CHO-K1. 

25 12. Precede d'utiisation d'un vecteur comme marqueur s6lectionnabie dominant dans un processus de 
co-transformation qui comprend : 



(a) 1'utUIsation d'un vecteur capable, dans une cellule hdte transformante, d'exprimer une sequence d'ADN 
30 recombinant qui code pour une enzyme GS active et une sequence d'ADN recombinant qui code pour 

la sequence complete d'amino-acldes d*une proteine d6sir£e autre que la GS ; 

(b) futilisation (Tune cellule hdte eucaryote qui est un prototrophe de glutamine ; 

(c) la transformation de la cellule hdte par le vecteur ; et 

(d) la selection de cellules transformantes qui sont r6sistantes 6 des inhibiteurs de la GS, 

35 

ce grfice 6 quoi sont s6lectionn6e9 des cellules transformantes dans lesquelles la sequence d6riv6e 
du vecteur, codant pour la GS sert de marqueur seiectionnable dominant et co-amp!ifiable. 
13. Precede d'utilisation d'un vecteur comme marqueur seiectionnable dominant dans un processus de 
co-transformation qui comprend : 

40 



(a) r utilisation (fun vecteur capable, dans une cellule hdte transformante, d'exprimer une sequence d'ADN 
recombinant qui code pour une enzyme GS active ; 

(b) I'utnisation d'un second vecteur, dans une cellule hdte transformante, d'exprimer une sequence d'ADN 
45 recombinant qui code pour la sequence complete cfamino-acides d'une proteine d6sir6e autre que la 

GS; 

(c) rutilisatjon d'une cellule hdte eucaryote qui est un prototrophe de glutamine ; 

(d) la transformation de ladite cellule hdte par 6 la fois lesdits premier et second vecteurs ; et 

(e) la selection de cellules transformantes qui sont rtslstantes 6 des inhibiteurs de la GS, 

80 

ce grflce 6 quoi sont seiectionn6es des cellules transformantes dans lesquelles la sequence d6riv6e 
des vecteurs, codant pour la GS sert de marqueur seiectionnable dominant, co-amplifiable. 
14. Vecteur ADN recombinant comprenant : 



55 



(a) une sequence d'ADN recombinant qui code pour une enzyme GS active ; et 

(b) une sequence d'ADN recombinant qui code pour la sequence complete cfamlno-acides d'une proteine 
d6slree autre que la GS, 
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le vectour 6tant capable, dans una cellule hflte transfonnante, d'exprtmer Tune et I'autre desdltes 
sequences (a) et (b) d'ADN recombinant 

15. Plasmlde contBnant un minigftne de OS, ledit minigftne comprenant un fragment d'ADNc ayant la 
s sftquence des rftsidus 1 ft 753 de la sftquence <f ADNc representee ft la figure 2 et un fragment EcoRI-SstJ de 

3, 4 kb de I'ADN g6nomlque du hamster qui code pour TARNm correspondant ft la sequence des rftsidus 754 
ft 1421 de la sequence d'ADNc representee ft la figure 2, rexfrftmltd 3' du fragment d'ADNc fttantfustonnfte 
dlrectement ft rextrftmite 5' du fragment gftnomlque. 

16. Plasmlde contenant un promoteurtardlf de SV40fuslonn6 en amontd'un mlnigftne de GS tel que dftfinl 
10 ft la revendication 15, de telle aorta que le promoteurtardlf de SV40 sort capable de diriger la transcription d'un 

ARNm codant pour la GS. 
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